Search CORE

57 research outputs found

A Bulk-Parallel Priority Queue in External Memory with STXXL

Author: GS Brodal
J Singler
JS Vitter
L Arge
MC Pinotti
N Deo
P Sanders
P Sanders
PJ Varman
R Dementiev
Publication venue
Publication date: 01/01/2015
Field of study

We propose the design and an implementation of a bulk-parallel external memory priority queue to take advantage of both shared-memory parallelism and high external memory transfer speeds to parallel disks. To achieve higher performance by decoupling item insertions and extractions, we offer two parallelization interfaces: one using "bulk" sequences, the other by defining "limit" items. In the design, we discuss how to parallelize insertions using multiple heaps, and how to calculate a dynamic prediction sequence to prefetch blocks and apply parallel multiway merge for extraction. Our experimental results show that in the selected benchmarks the priority queue reaches 75% of the full parallel I/O bandwidth of rotational disks and and 65% of SSDs, or the speed of sorting in external memory when bounded by computation.Comment: extended version of SEA'15 conference pape

arXiv.org e-Print Archive

Crossref

KITopen

Near-Optimal Computation of Runs over General Alphabet via Non-Crossing LCE Queries

Author: C Hohlweg
CSJA Nash-Williams
D Kosolobov
GS Brodal
H Barcelo
J Fischer
M Crochemore
M Crochemore
M Crochemore
M Crochemore
M Crochemore
M Giraud
SJ Puglisi
W Rytter
W Rytter
Publication venue
Publication date: 01/01/2016
Field of study

Longest common extension queries (LCE queries) and runs are ubiquitous in algorithmic stringology. Linear-time algorithms computing runs and preprocessing for constant-time LCE queries have been known for over a decade. However, these algorithms assume a linearly-sortable integer alphabet. A recent breakthrough paper by Bannai et.\ al.\ (SODA 2015) showed a link between the two notions: all the runs in a string can be computed via a linear number of LCE queries. The first to consider these problems over a general ordered alphabet was Kosolobov (\emph{Inf.\ Process.\ Lett.}, 2016), who presented an

O(n (\log n)^{2/3})

-time algorithm for answering

O(n)

LCE queries. This result was improved by Gawrychowski et.\ al.\ (accepted to CPM 2016) to

O(n \log \log n)

time. In this work we note a special \emph{non-crossing} property of LCE queries asked in the runs computation. We show that any

n

such non-crossing queries can be answered on-line in

O(n \alpha(n))

time, which yields an

O(n \alpha(n))

-time algorithm for computing runs

arXiv.org e-Print Archive

Crossref

King's Research Portal

Hal-Diderot

HAL - UPEC / UPEM

A sub-cubic time algorithm for computing the quartet distance between two general trees

Author: Anders K Kristensen
BL Allen
C Christiansen
C Christiansen
Christian NS Pedersen
D Bryant
D Coppersmith
DF Robinson
DF Robinson
G Estabrook
GS Brodal
Jesper Nielsen
M Steel
M Stissing
MS Waterman
Thomas Mailund
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

Succinct Data Structures for Families of Interval Graphs

Author: A Bar-Noy
A Farzan
A Farzan
DZ Chen
GS Brodal
J Fischer
JC Yang
JI Munro
KS Booth
LC Aleardi
M Habib
MC Golumbic
MC Golumbic
P Bose
P Hanlon
P Zhang
R Diestel
S Benser
S Klavzar
SR Finch
TH Cormen
Publication venue
Publication date: 01/08/2019
Field of study

We consider the problem of designing succinct data structures for interval graphs with

n

vertices while supporting degree, adjacency, neighborhood and shortest path queries in optimal time in the

\Theta(\log n)

-bit word RAM model. The degree query reports the number of incident edges to a given vertex in constant time, the adjacency query returns true if there is an edge between two vertices in constant time, the neighborhood query reports the set of all adjacent vertices in time proportional to the degree of the queried vertex, and the shortest path query returns a shortest path in time proportional to its length, thus the running times of these queries are optimal. Towards showing succinctness, we first show that at least

n\log{n} - 2n\log\log n - O(n)

bits are necessary to represent any unlabeled interval graph

G

with

n

vertices, answering an open problem of Yang and Pippenger [Proc. Amer. Math. Soc. 2017]. This is augmented by a data structure of size

n\log{n} +O(n)

bits while supporting not only the aforementioned queries optimally but also capable of executing various combinatorial algorithms (like proper coloring, maximum independent set etc.) on the input interval graph efficiently. Finally, we extend our ideas to other variants of interval graphs, for example, proper/unit interval graphs, k-proper and k-improper interval graphs, and circular-arc graphs, and design succinct/compact data structures for these graph classes as well along with supporting queries on them efficiently

arXiv.org e-Print Archive

Crossref

SNU Open Repository and Archive

Computing Covers under Substring Consistent Equivalence Relations

Author: A Amir
A Amir
A Amir
A Apostolico
A Apostolico
A Apostolico
BS Baker
C Iliopoulos
CS Iliopoulos
D Breslauer
D Moore
D Moore
DE Knuth
G Gourdel
GS Brodal
J Kim
M Christou
M Christou
M Kubica
T Ehlers
Y Li
Y Matsuoka
Publication venue
Publication date: 30/07/2020
Field of study

Covers are a kind of quasiperiodicity in strings. A string

C

is a cover of another string

T

if any position of

T

is inside some occurrence of

C

T

. The shortest and longest cover arrays of

T

have the lengths of the shortest and longest covers of each prefix of

T

, respectively. The literature has proposed linear-time algorithms computing longest and shortest cover arrays taking border arrays as input. An equivalence relation

\approx

over strings is called a substring consistent equivalence relation (SCER) iff

X \approx Y

implies (1)

|X| = |Y|

and (2)

X[i:j] \approx Y[i:j]

for all

1 \le i \le j \le |X|

. In this paper, we generalize the notion of covers for SCERs and prove that existing algorithms to compute the shortest cover array and the longest cover array of a string

T

under the identity relation will work for any SCERs taking the accordingly generalized border arrays.Comment: 16 page

arXiv.org e-Print Archive

Crossref

Reconstructing phylogenies from noisy quartets in polynomial time with a high success probability

Author: A Ben-Dor
AC Davison
BME Moret
C Jordan
D Pelleg
DL Swofford
G Wu
Gang Wu
GD Vedova
GS Brodal
Guohui Lin
J Gramm
Jia-Huai You
K Strimmer
M Csűrös
Ming-Yang Kao
MY Kao
N Saitou
PE Kearney
PL Erdős
PL Erdős
SK Kannan
T Jiang
T Jiang
TH Jukes
V Berry
WM Fitch
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background In recent years, quartet-based phylogeny reconstruction methods have received considerable attentions in the computational biology community. Traditionally, the accuracy of a phylogeny reconstruction method is measured by simulations on synthetic datasets with known "true" phylogenies, while little theoretical analysis has been done. In this paper, we present a new model-based approach to measuring the accuracy of a quartet-based phylogeny reconstruction method. Under this model, we propose three efficient algorithms to reconstruct the "true" phylogeny with a high success probability. Results The first algorithm can reconstruct the "true" phylogeny from the input quartet topology set without quartet errors in <it>O</it>(<it>n</it>2) time by querying at most (<it>n </it>- 4) log(<it>n </it>- 1) quartet topologies, where <it>n </it>is the number of the taxa. When the input quartet topology set contains errors, the second algorithm can reconstruct the "true" phylogeny with a probability approximately 1 - <it>p </it>in <it>O</it>(<it>n</it>4 log <it>n</it>) time, where <it>p </it>is the probability for a quartet topology being an error. This probability is improved by the third algorithm to approximately <inline-formula><m:math name="1748-7188-3-1-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mn>1</m:mn><m:mrow><m:mn>1</m:mn><m:mo>+</m:mo><m:msup><m:mi>q</m:mi><m:mn>2</m:mn></m:msup><m:mo>+</m:mo><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac><m:msup><m:mi>q</m:mi><m:mn>4</m:mn></m:msup><m:mo>+</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mn>16</m:mn></m:mrow></m:mfrac><m:msup><m:mi>q</m:mi><m:mn>5</m:mn></m:msup></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF"> MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqaIXaqmaeaacqaIXaqmcqGHRaWkcqWGXbqCdaahaaqabeaacqaIYaGmaaGaey4kaSYaaSaaaeaacqaIXaqmaeaacqaIYaGmaaGaemyCae3aaWbaaeqabaGaeGinaqdaaiabgUcaRmaalaaabaGaeGymaedabaGaeGymaeJaeGOnaydaaiabdghaXnaaCaaabeqaaiabiwda1aaaaaaaaa@3D5A@</m:annotation></m:semantics></m:math></inline-formula>, where <inline-formula><m:math name="1748-7188-3-1-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>q</m:mi><m:mo>=</m:mo><m:mfrac><m:mi>p</m:mi><m:mrow><m:mn>1</m:mn><m:mo>−</m:mo><m:mi>p</m:mi></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF"> MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyCaeNaeyypa0tcfa4aaSaaaeaacqWGWbaCaeaacqaIXaqmcqGHsislcqWGWbaCaaaaaa@3391@</m:annotation></m:semantics></m:math></inline-formula>, with running time of <it>O</it>(<it>n</it>5), which is at least 0.984 when <it>p </it>< 0.05. Conclusion The three proposed algorithms are mathematically guaranteed to reconstruct the "true" phylogeny with a high success probability. The experimental results showed that the third algorithm produced phylogenies with a higher probability than its aforementioned theoretical lower bound and outperformed some existing phylogeny reconstruction methods in both speed and accuracy.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Higher levels of glutamate in the associative-striatum of subjects with prodromal symptoms of schizophrenia and patients with first-episode psychosis

Author: A Abi-Dargham
A Breier
A Di Costanzo
A Di Costanzo
A Graff-Guerrero
AA Grace
AA Grace
AA Grace
AR West
AR Yung
AR Yung
Ariel Graff-Guerrero
B Moghaddam
B Saraceno
C Abbott
C Cepeda
C de la Fuente-Sandoval
C Jahshan
C Pantelis
Camilo de la Fuente-Sandoval
D Mamo
D Mayer
David Mamo
DC Javitt
DC Javitt
DF Horrobin
DP Auer
ES Lutkenhoff
FA Middleton
FR Sharp
FX Vollenweider
G Segovia
GS Smith
HM Olbrich
HN David
J De Keyser
J Hietala
J O’Neill
J Théberge
J Théberge
J Théberge
JA Lieberman
JA Stanley
JD Schmahmann
Jesús Ramírez-Bermúdez
JM Stone
JM Stone
JR Bustillo
JR Bustillo
JR Bustillo
JW Olney
JW Olney
KJ Friston
KS Cadenhead
KS Cadenhead
L Farde
LM Rowland
LS Kegeles
LS Kegeles
LS Kegeles
LT van Elst
M Camps
M Carlsson
M Laruelle
M Sato
MB First
MS Keshavan
MS Levine
NV Kulagina
O Mawlawi
OD Howes
OD Howes
P Brodal
P Ohrmann
P Seeman
P Tibbo
P Tibbo
Pablo León-Ortiz
PB Barker
PN Jayakumar
PS Ariyannur
R Bartha
R Eluri
R Hurd
RA Edden
Rafael Favila
RG Steen
RM Kelly
RP Dum
S Aalto
S Aalto
S Kapur
S Kapur
S Lehericy
S Ruhrmann
SE Chua
SE Purdon
SJ Borgwardt
SJ Wood
SW Provencher
Sylvana Stephano
TD Cannon
TE Bates
TJ Miller
TK Rajji
V Clementi
V Villalta-Gil
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

The glutamatergic and dopaminergic systems are thought to be involved in the pathophysiology of schizophrenia. Their interaction has been widely documented and may have a role in the neurobiological basis of the disease. The aim of this study was to compare, using proton magnetic resonance spectroscopy (1H-MRS), glutamate levels in the precommissural dorsal-caudate (a dopamine-rich region) and the cerebellar cortex (negligible for dopamine) in the following: (1) 18 antipsychotic-naïve subjects with prodromal symptoms and considered to be at ultra high-risk for schizophrenia (UHR), (2) 18 antipsychotic-naïve first- episode psychosis patients (FEP), and (3) 40 age- and sex- matched healthy controls. All subjects underwent a 1H-MRS study using a 3Tesla scanner. Glutamate levels were quantified and corrected for the proportion of cerebrospinal fluid and percentage of gray matter in the voxel. The UHR and FEP groups showed higher levels of glutamate than controls, without differences between UHR and FEP. In the cerebellum, no differences were seen between the three groups. The higher glutamate level in the precommissural dorsal-caudate and not in the cerebellum of UHR and FEP suggests that a high glutamate level (a) precedes the onset of schizophrenia, and (b) is present in a dopamine-rich region previously implicated in the pathophysiology of schizophrenia.peer-reviewe

OAR@UM

Crossref

PubMed Central

DON content in oat grains in Norway related to weather conditions at different growth stages

Crossref

Range Minimum Query Indexes in Higher Dimensions

Author: A Amir
A Amir
B Chazelle
D Harel
ED Demaine
GS Brodal
GS Brodal
J Fischer
J Vuillemin
K Sadakane
M Golin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Range minimum queries (RMQs) are essential in many algorithmic procedures. The problem is to prepare a data structure on an array to allow for fast subsequent queries that find the minimum within a range in the array. We study the problem of designing indexing RMQ data structures which only require sub-linear space and access to the input array while querying. The RMQ problem in one-dimensional arrays is well understood with known indexing data structures achieving optimal space and query time. The two-dimensional indexing RMQ data structures have received the attention of researchers recently. There are also several solutions for the RMQ problem in higher dimensions. Yuan and Atallah [SODA’10] designed a brilliant data structure of size O(N) which supports RMQs in a multi-dimensional array of size N in constant time for a constant number of dimensions. In this paper we consider the problem of designing indexing data structures for RMQs in higher dimensions. We design a data structure of size O(N) bits that supports RMQs in constant time for a constant number of dimensions. We also show how to obtain trade-offs between the space of indexing data structures and their query time.SCOPUS: cp.kinfo:eu-repo/semantics/publishe

Crossref

DI-fusion

Submatrix Maximum Queries in Monge Matrices Are Equivalent to Predecessor Search

Author: A Aggarwal
A Amir
A Farzan
B Chazelle
ED Demaine
GS Brodal
GS Brodal
ML Fredman
MM Klawe
P Gawrychowski
RE Burkard
Y Nekrich
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref